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Abstract 

We consider the edge-based compartmental models for infectious disease spread introduced in [5]. 
These models allow us to consider standard SIR diseases spreading in random populations. In this paper 
we show how to handle deviations of the disease or population from the simplistic assumptions of [5]- 
We allow the population to have structure due to effects such as demographic detail or multiple types 
of risk behavior the disease to have more complicated natural history. We introduce these modifications 
in the static network context, though it is straightforward to incorporate them into dynamic networks. 
We also consider serosorting, which requires using the dynamic network models. The basic methods we 
use to derive these generalizations are widely applicable, and so it is straightforward to introduce many 
other generalizations not considered here. 

1 Introduction 

In [S], we introduced the edge-based compartmental modeling approach for infectious disease spread. We 
showed how to create models that account for social heterogeneity and contact duration simultaneously. 
However, there are many other important effects that govern the spread of disease through a population which 
were not considered. For example it is well-established that for diseases such as influenza, heterogeneities in 
infectiousness/susceptibility and biases in mixing among age groups plays an important role [12^. Further, 
it is widely believed that for some diseases there are different stages of infection that have population scale 
impacts such as a highly infectious early phase followed by a chronic less infectious phase as in HIV [TU\ 
or a latent uninfectious early phase followed by an infection active phase as in Tuberculosis. If there is 
structure within a population or within the progression of disease is important for our models to incorporate 
the details. 

Many of these effects have been studied under mass action mixing assumptions. In this paper we show 
that a range of assumptions can be reliably and simply captured by edge-based compartmental models. We 
begin by rederiving the simplest such model for Configuration Model networks. We then look at a number 
of different population or disease structures, summarized in Table [l] and show how to develop an edge-based 
compartmental model incorporating these effects. The effects in the list are chosen to demonstrate the 
flexibility of the approach. It is by no means exhaustive, and it is not difficult to combine different effects. 
These model populations are mostly static, and we describe them in the Conflguration Model case, but it 
is straightforward to adapt these effects to other static or dynamic networks. Because of its epidemiological 
importance, we end the paper by showing a model incorporating serosorting where the population structure 
changes in response to the disease. This is inherently a dynamic network process, so we must use a dynamic 
network model to study it. 
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Model 


Brief Description 


Section 


Directed Networks 


Model for a disease in which some contacts are not symmetric in terms of 
disease risk. 




3.1 




Heterogeneous Individuals 


Model for populations with heterogeneities in infectiousness and/or suscep- 
tibility that do not correlate with population structure 




3.2 




Assortativc mixing by type 


Model for populations with demographic groups that have heterogeneities in 
infectiousness and/or susceptibility where contacts are affected by an indi- 




3.3 




Multiple transmission modes 


Model for a disease that can be transmitted by more than one type of be- 
havior and the network structure induced by each behavior is different. 




3.4 




Multiple infectious stages 


Model for a disease which has several infectious stages of possibly varying 
duration or infectiousness. 




3.5 




Serosorting 


Model for dynamic network where edges break or are created at rates depen- 
dent on the status of partners. 


4.1 4.2 



Table 1: The edge-based compartmental models considered here. All of these except serosorting are presented 
using the (static) Configuration Model network structure. Serosorting is presented in two different dynamic 
network contexts. 

2 The basic model 

The static models we present here are all generalizations of the basic Configuration Model (CM) static 
network epidemic model. To set the stage, we first define a CM network [6l [7l HI [1] . Because we will use the 
same underlying approach, we also briefly describe the method of [S]. 

In a CM network, each individual is represented by a node which is connected to other nodes by edges 
which can transmit disease. To construct the network, each node is assigned a number of edges (it's degree) 
k with probability P{k). The edges connect randomly to other neighbors using proportional mixing, so that 
the probability of selecting a neighbor of degree k is kP{k)/ (K) where (K) denotes the average of k. It is 
convenient to define 

^(x) = ^ P(A:)a;'= 

k 

Note that ij'{x) = J^k kP{k)x^-^ and V'(l) = {K). 

When we considered the spread of epidemics in CM networks in our original paper [5], we used the flow 
diagram of figure [T] Here S{t) is the proportion of the population still susceptible, I{t) is the proportion 
infected, and R{t) the proportion recovered. Assuming we know S{t), the probability a random individual 
is susceptible, we are able to determine / and i?by/=l — 5' — i?, R = jl. 

To calculate the probability a random individual is susceptible, we choose a random test node u uniformly 
from the population. We alter u so that if infected, u does not transmit to its neighbors. This helps us 
assume the status of its neighbors are independent, but does not affect the probability u is susceptible. We 
use 6 to be the probability that a random neighbor v oiu has not transmitted infection to u, and we break 9 
into three parts: 05, </)/, and </)fl, which are the probability v is still susceptible, the probability v is infected 
but has not transmitted to m, and the probability v is recovered and did not transmit to u. Because the 
infection rate is (5 and the recovery rate is 7 it is relatively straightforward to see that the fluxes from 0/ to 
1 — 9 and (pn are f3(j)i and 7^7 respectively. The calculation of the flux from (j)s to 0/ is less obvious. We 
have two options. We can calculate the flux directly or we can calculate <j)s explicitly as was done in [5]. 
The first option is more general, but the second is simpler. 

The flow diagram of figure [T] gives 

9 = -ficj^i (1) 

The remainder of our derivation focuses on finding Because 9 ~ (ps -\- + (t>R, we have 0/ — 9 ~ (j)s — ipR- 
Thus we simply need to calculate (ps and pR in terms of 9 to find 0/ in terms of 9. To calculate (ps^ 
we use the fact that the probability a neighbor v has degree k is kP{k)/ (K). Since u is prevented from 
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Figure 1: Flow diagram for Configuration Model networks. To find 6, we must find 0/. We do this 
by calculating (j)s explicitly which is ip' {0) / ip' {!) . Because the flux into (j)B. and 1 — 9 arc proportional, we 
can find in terms of 6. We then use = — (jjs — fpRi we are able to find a differential equation for 
9 in terms of 9. 



infecting w, the probability v is susceptible given its k is 9''-^. Thus chs is a weighted average of this, 
= Ek kP[k)9^-^/ (K) = V'WMll)- 

To calculate 4>u, we look at figurejl] The fluxes into <j)fj and into 1—9 are proportional to one another, with 
the proportionality coefficient 7//?. Since they both begin at 0, this means 0^ — 7(1 — 9)//3. Consequently 



This leads to 



Thus we have the system of equations 



^'(1) 



^ = -/3^ + /?|i|^+7(l-e) (2) 

i? = 7J S = ipie) I = l-S-R (3) 

which has just two ordinary differential equations (ODEs). In fact ^ does not depend on ([3|, so the system 
is governed by the single ODE ([2]). 



2.1 Generalizing the model 

In the following, we generalize the model for many static network situations. The basic approach is to 
consider a random test node which is prevented from causing infection. Then consider the edges which could 
transmit infection to it. We determine the probability the edges have not transmitted, which may depend 
on the test node, the neighbor, or details of the contact. Our approach to determining this probability is 
the same as above. Once we know the probability any given edge has not transmitted to the test node, we 
can calculate the probability that the test node is susceptible, from which we can calculate the proportion 
of the population that is susceptible, infected, or recovered. 



3 Epidemics on generalized static networks 
3.1 Directed Networks 

There are a number of realistic scenarios where infection can transmit in only one direction. Examples 
include blood transfusions, a restaurant cook infecting a patron, and even a patient infecting a doctor where 
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Figure 2: Directed CM model. Flow diagram for a network with directed and nondirected edges. We 
consider the two edge types separately. The evolution of edges is similar to figure [l] We can assign different 
infection rates for each edge type. 



they come into contact only because of the patient's infection. The probability and final size of epidemics 
for this scenario have been studied previously [3], but not the dynamics. 

We can investigate the dynamics in almost the same manner as before. Assume that the network consists 
of both directed and nondirected edges. The disease can transmit along a directed edge only following the 
edge direction, while nondirected edges may be followed in either direction. Let Pd and /3„ denote the rate 
of transmission along directed and nondirected edges respectively. Recovery occurs at rate 7 regardless of 
how infection is transmitted. 

We refer to edges pointing to a node of interest as in-directed, and those pointing away as out-directed 
edges. The probability of having ki in-, ko out-, and fc„ nondirected edges is given by P{ki,ko,kn). We 
define 

i;{x,y,z)= P{h,ko,kn)x''^y''''z''" 

We again consider a random test node u which is prevented from causing infection. We define Od{t) and 
dn{t) to be the probability an in-directed edge or nondirected edge to u respectively has not transmitted 
infection to u. The probability that u is still susceptible is ipi^d, 1, &n)- We use the variables (f'S.d, <t>i.d, and 
4>R,d to be the equivalent of (f>s, and 4)b. seen before for in-directed edges. Following the same approach 
as before we arrive at the flow diagrams in figure [2j 

Consider a neighbor v with a directed edge to u. Because of how v is chosen, the probability it has ki 
in-, ko out-, and fc„ nondirected edges is koP{ki,ko,kn)/ {Kq). The probability that v is still susceptible 
is c^s,d = Ek,,k^,k^koP{h,ko,kn)e''/9t / (Ko) = ^^{0d,l,0n)/ i^i'ilAA)- The probability that v has 
recovered without infecting u is (l)R^d = 7(1 — 0d)l Pd- Because 4>j_d = Qd — 4'S,d — 4>R,d we have 4>i,d in terms 
of 9d and 6'„. This gives us 9d in terms of 6d and 6'„. A similar expression holds for We have 
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Figure 3: Directeci CM example. Results for the directed networks described in section 3.1.1 using k — A, 
13d = 0.2, and /3„ = 0.4. We choose t = to correspond to 1% cumulative incidence. Theory (dashed) 
correctly predicts dynamics of simulations in a population of 5 x 10^ individuals (solid). 



to give us the proportion of susceptible, infected, and recovered individuals. 

If the system only has directed edges, then we can drop 0„ from the analysis and ip{x,y,z) reduces to 
ip{x,y). Such a model could be used to study the impact of superspreaders where the probability of receiving 
infection from an infected node is similar for all nodes (in-degrees are similar), but some nodes have many 
more neighbors to infect than others (high variance in out-degree). 



3.1.1 Example 

Consider a population for which the average in-degree, out-degree, and nondirected degrees are each k as 
follows: Each node has in-degree k. The out-degree kg is uniformly chosen from up to 2k inclusive, and 
the nondirected degree is 2k — ko- For this population, 

. ^2fe+l ,,2*:+l 

if \ k y 

w{x,y,z)—x — ; . 

{2k + l){z-y) 

Figure [i] shows results for k — 4, Pd — 0.2, and /?„ = 0.4. 



3.2 Heterogeneous infectiousness and susceptibility 

Assume now that there is a parameter a which measures a node's ability to become infected and cause 
infection, but does not influence the contact structure of the population. We refer to the value of a for a 
node as its type, and the probability a node has a given type a is Q{a). The recovery rate 7q, of a type-a 
node and the transmission rate Pa,a> from a type-a' node to a type-a node are type dependent. The final 
size for a special case of this model where a node's infectiousness and susceptibility are uncorrelated is given 
ing]. 

Consider a random test node u of type a. Let Oa,a' denote the probability that an edge from a type-a' 
neighbor v to u has not transmitted infection from v to u, and similarly 4>s:a,a' the probability v is still 
susceptible, 4'i:a,a' the probability v is infected but the edge has not transmitted, and (jiR-.a^a' the probability 
that V has recovered without transmitting. We define 9a = (^a,a'Q{oi') as the probability that a random 
edge to u has not transmitted infection to u. We use the original definition of '(p{x) = P{k)x^ . 

—k — l — 

We find that v is susceptible with probability 4>s:a,a' = X]fc ^^(^)^a' / {^) ^ V'''(^q')/V''(1)- We also 
find that (t>R:a,a' ~ 7a' (1 ^ da.a') / Pa.a' ■ Then the flow diagram in figure |4] shows that 

^a,a' — /5a,a' ^a,a' H~ Pa, a' T77T\~ ^ ^a,a' ) ('^) 
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Figure 4: Heterogeneous infectiousness/susceptibility model. We separate nodes by type a, but 
assume that a has no effect on connectivity. Both infectiousness and susceptibihty may depend on a. We 
must consider edges between each pair of types a and a' separately. The evolution of edges is similar to 
before. 
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Figure 5: Heterogeneous infectiousness/susceptibility example. Epidemics spreading in a population 
for which half have received a leaky vaccine. Vaccinated individuals are half as infectious and half as 
susceptible. We choose t = to correspond to 1% cumulative incidence. Simulations in a population of 
5 X 10^ individuals (solid) and theory (dashed) are in good agreement. 



The probabilities a type-a node is still susceptible, infected, or recovered satisfy 

Ra = lala Sa = V'(^q) /q = 1 - S'q - i?Q (8) 

The total population in each state is given by 

5 = ^5oQ(a) / = ^/„Q(a) i? = ^i?„g(a) (9) 

a ct a 

3.2.1 Example 

One application of this model is to the impact of a partially effective vaccination. Vaccination generally 
reduces the susceptibility of a node, but could either increase or decrease the infectiousness of a node by 
reducing the severity of symptoms (less sick individuals may shed less virus but also maintain stronger 
contact intensity while symptomatic). If only part of the population is vaccinated, then the population can 
be divided into those who have or have not received vaccination. 

Consider a population with a negative binomial degree distribution NB(3/2, 8/9) with size r = 3/2 and 
probability p ~ 8/9, giving an average degree of pr/(l — p) = 12 and variance of pr/(l — p)^ = 108. For a 
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Figure 6: Assortative mixing by type model. We separate nodes by type. We assume that type may 
influence infectiousness and susceptibility as well as connections. For simplicity, we assume a finite number 
of groups. The resulting system is similar to our result for correlated infectiousness and susceptibility, except 
in the impact on contacts. 



negative binomial distribution NB(r, p) we have ipix) — [(1 — —px)Y, so 

i^{x) = (9-82:)-3/2 

Assume that half of the population has received a leaky vaccine such that vaccinated nodes have reduced 
susceptibility, and — if infected — reduced infectiousness and infection duration. Let 7 be the rate of 
recovery for unvaccinated nodes and /3 the rate of infection between unvaccinated nodes. Vaccinated nodes 
recover at rate 27, and the rate of infection between a vaccinated and unvaccinated node (in either direction) 
is 13/2 while the rate of infection between two vaccinated nodes is /3/4. The vaccine is distributed uniformly. 
Results for/3 = 0.3, 7 = 0. 5 are shown in figure [Sj 

3.3 Populations with assortative mixing by type 

In many instances there is biased mixing between or within demographic groups, and the transmission/recovery 
parameters for the different groups may differ. For example, the spread of infiuenza is strongly affected by 
the increased level of mixing and increased infection rates between children. Many sexually transmitted 
diseases are strongly affected by differences in mixing rates and risk behavior among MSM and heterosexual 
groups. For this reason it is useful to have a model accommodating different levels of mixing within and 
between groups. The model we derive in this section is equivalent (though simpler) to that of [ITJ. 

Assume that the population is made up of M groups, and let Pj{ki, ^2, . . . , ^m) denote the probability 
a node of group j has ki contacts with nodes of group I for I — 1, . . . , M. To simplify notation, we denote 
this by Pj{k) where k — (/ci, A:2, • ■ • , ^m)- We similarly set x = {xi,X2, ■ ■ ■ ,xm) and use x'^ to denote 
xl'x^^ ■■■x'l^'. We set 

^,ix) = J2P,{k)x'^ 

k 

and let jSjj be the rate of transmission across an edge from group I to group j. We similarly define 7j to be 
the recovery rate of a group j node. Define Oj^i to be the probability an edge to a group j node coming from 
a group I node has not transmitted infection. 

If our test node u is of type j, then the probability that a neighbor v of type I is still susceptible is 
<I^S:i,i = Efe kjPiik)e1\e'l-^ ■ ■ ■ ef/r' ■ ■ ■ Zk kjPiik) = £rMdi)/iiTMi) where Oi denotes the vector 
{01,1, ^/,2, ■ • • , 01, m) and 1 denotes the vector (1,1,..., 1). We can also show that v has recovered without 
transmitting to u with probability 4'R:j,i = 7/(1 — Oj,i)/ I3j,i- 

Figure [6] gives 

03,1 = -^3,1^3,1 + ^3,1 a , + 7i(l - 63,1) (10) 
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The denominator -r^-ipill) is simply the average of kj for nodes of group I. We find 

i?,=7,/, S,{t)=^,i9,) I, = l-S,-R, (11) 

As a special case, we can consider a population where the number of contacts a node has with one group 
is assigned independently of the number that node has with any other group. We set Pj,„(fc) to be the 
probability a node of group j has k edges to a node of group n and define ipj^nix) — J2k Pj,nik)x''. Then 
ipj{0j) factors and may be written as '0j,i(^i,i)V'j,2(^'i,2) • • • '^j,M(Qj,M)- this special case we get 

03,1 = -fij,iO,,i + |^n^M(^?M)j + 7/(1 - 0,^1) (12) 

3.3.1 Example 

To demonstrate the ability to capture demographic information, we consider a population made up of two 
groups, which we arbitrarily label children and adults. The between-group degrees are binomially distributed 
with Bi(4, 1/2), so that the average between-group degree is 2. An adult's within-group degree equals its 
between-group degree. In contrast, a child's within-group degree is given by 5 times its between-group 
degree. Thus people with higher between-group degree have higher within-group degree. We get 

ll)a{Xa,Xc) = {Q.bXaXc + 0.5)'' 
ll^dXa^Xc) = {Q.bXaxl -f 0.5)^ 

We set the disease parameters to be 

7c - 1 7a - 0.1 

/3c,c=0.3 /3c,a=0.1 /3a,a=0.1 /?a,c= 0.1 

The results are shown in the top of figure [7] 

We repeat with the same parameters, but this time with the correlations in degree switched so that higher 
between-group degrees implies lower within-group degrees. An adult's within-group degree is 4 — fc where k 
is its between-group degree. A child's within-group degree is 5(4 — k) where k is its between-group degree. 
We have 

i^a(.Xa,Xc) = (O.Sxa + 0.5a;c)^ 
ipcixa.Xc) = (0.5x^-1-0.50;^)^ 

The results are shown in the bottom of figure [7] 

The distribution of within and between-group contacts in the two populations are the same. The only 
distinction is that the correlations of within and between-group contacts are different. This results in 
differences in the course of the epidemics. A mass action model could not distinguish between the populations. 

In both cases the disease spreads quickly through the child population. Early on the spread in adults 
is driven largely by the explosive growth in children. Because of the correlations of adults' within and 
between-group degrees, those adults who are infected by children in the first scenario tend to have more 
adult contacts and infect high-degree adults who in turn infect more high-degree children. In the second 
scenario however, infected children tend to infect fewer adults who tend to have fewer adult contacts. The 
disease does not grow as quickly, but it also decays less quickly because more high degree nodes remain. 

3.4 Multiple modes of transmission 

Rather than having different types of nodes, there may be multiple modes of transmission with different 
mixing and infection rates for each mode. For example, HIV may spread through sexual contact and needle- 
sharing. The sexual contact network may have little to no relation to the needle-sharing network. Assume 
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Figure 7: Assortative mixing by type example. Comparison of theory and simulated results for mixing 
with demographic groups. We also show the predicted levels of infection in each subgroup. Simulations in a 
population of 5 x 10^ individuals (solid) and theory (dashed) are in good agreement. The difference between 
the top and bottom result from changing the correlations of within and between-group mixing. We choose 
t = to correspond to 1% cumulative incidence. 
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Figure 8: Multiple modes of transmission model. Flow diagram showing the flux of edges for the j-th 
contact type for a disease which has multiple modes of transmission. 
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Figure 9: Multiple modes of transmission example. Disease spread in a population with three different 
types of contacts, each with a different degree distribution. Simulations in a population of 5 x 10^ individuals 
(solid) and theory (dashed) are in good agreement. We choose t = to correspond to 1% cumulative 
incidence. 



there are M types of contacts and that the rate of transmission along contacts of mode j is given by Pj . Let 
the joint distribution of the number of each mode of contacts be given by P{k) where k — (fci, ^2, . . . , fc^f ). 
Assume recovery rates are independent of how infection was acquired. We set x = (xi, . . . , xm) and denote 



a;^ •••x^ by x''. We define 



Y^p{k)x'^ 



We can apply the same method to each edge type as shown in figure |8] We set (j)s,3 to be the probability 
that an edge of type j connects the test node to a susceptible neighbor. If 9j is the probability a contact 
of type j has not yet transmitted infection, we set 9 — {9i,92, ■ ■ ■ ,0m)- We find — g|-V'(0)/g§-'i/'(l) 
and similarly the probability an edge of type j connects to a recovered neighbor which did not transmit is 
(j>j^ j = 7(1 - 0j)/l3j. As before we find S{t) = V(^)- We have 



-7(1 



'af-^(l) 

In the case where the degree with respect to one contact type is independent of that with respect to another, 



I = 1-S-R 



(13) 
(14) 



we can simplify equation ( 13 1 to be similar to equation ( 12 ) 



3.4.1 Example 

We consider a population with three different types of contacts. We take ki, fe, and to denote the 
number of each type of contact a node has. We assume that contact type 1 is binomially distributed with 
Bi(2, 0.5) (giving a mean of 1). Contact type 2 is geometrically distributed, with mean 2. Contact type 3 
has a negative binomial distribution NB(1, 1/4) with mean 1/3 and variance 4/9. The numbers of contacts 
an individual has of each type are assigned independently. We find 

, (2:1 + 1)^ X2 3 



4 2 -X2 4:^X3 

We take 7 = 1 and set /3i = 1, 132 — 0.5, and fS^ = 3 for each contact type. We compare simulation and 
theory in figure [9j 
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Figure 10: Multiple infectious stages model. Flow diagram for a disease with several infected stages. 
When a disease progresses through several states (or has an infectious period that is not exponentially 
distributed) it is convenient to use a stage-progression model to represent the state of an edge. 



3.5 Multiple infectious stages 

There are a number of diseases with multiple infectious stages such as Tuberculosis and HIV. Some diseases 
begin with a non-infectious latent phase, some begin with a highly infectious acute stage before settling into 
a long-term chronic stage, and others oscillate between phases of high and low infectiousness. To model 
such situations we adapt a standard chain progression model, for which there are M infectious phases shown 
in figure [TOj We are not able to explicitly solve for all variables in terms of 6, so we must find the fluxes 
between the compartments. We can still find 4>s = ip' {9) / ip' {1) , so we are able to find in terms of the 
other variables using (ps + 'f'R. + J2j 4>i,j — 0. We obtain 



'Ai., (15) 



M 



'/'/J = 7J-10/J-1 - 7j0/j - M>i>l (17) 

0_R = 1m4'im (18) 

= 7,-1^-1 - Ijlj M >j>\ (19) 

M 

R^lMhi s^^{e) h = l-S-R-Y,Ij (20) 

Where Ij is the proportion of the population in the j-th infectious class. 



3.5.1 Example 

Consider now the spread of a disease for which there are three infectious stages. The first stage is moderately 
infectious but not long, with Pi = 0.2, 71 = 1. The second stage is much longer, but has a substantially lower 
infectiousness, with — 0.01 and 72 — 0.08. The final stage has an intermediate duration but substantially 
higher infectiousness, with = 2 and 73 = 0.4. We assume that the disease spreads in a population with 
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Figure 11: Multiple infectious stages example. The spread of the disease described in |3.5.1| with three 
infectious stages. Simulations in a population of 5 x 10^ individuals (solid) and theory (dashed) are in good 
agreement. We choose i = to correspond to 1% cumulative incidence. 

degree distribution NB( 1,4/5) having mean 4 and variance 20 with 

iP{x) = (5 - 4a;)-i 



The results are shown in figure 11 



4 Dynamic Networks and Serosorting 

For some diseases, it is not uncommon for individuals to actively seek out contacts of similar disease status 
(as in HIV [9] or Leprosy [2]) or even discordant status (as in "chicken pox parties" or "swine flu parties"). 
This is commonly known as serosorting. To study these populations, we must use dynamic network models, 
which we developed in [5j. 

We study serosorting in two models. In the first, we use an "actual degree" model where an individual 
has a given number of potential contacts, of which only a proportion are active at any given time. In the 
second, we use an "expected degree" model in which individuals break any existing contacts at a fixed rate, 
but different individuals may create contacts at differing rates leading to a variation in the expected number 
of contacts across the population. 

For simplicity, we will assume that there is no recovered class, and once infected an individual remains 
infected. This restriction is easily removed, but by using it, we are able to simplify the model and reduce the 
number of parameters needed. We again consider a test node, and as before we assume that if infected the 
test node does not cause any infections. We make an additional assumption that if infected the test node 
continues to behave as if it were susceptible, and that its potential neighbors treat it as if it were susceptible. 
We can think of the test node as an individual who is immune, but is unaware of that immunity, and we track 
the probability that the test node has not yet received a sufficient dose to infect a non-immune individual. 



4.1 Actual Degree Serosorting model 

In the actual degree formulation, we think of an individual as having k stubs or half-edges. These stubs may 
be active (and connected to another node's stub) or dormant (and available to form new edges). 

When an edge breaks, the corresponding stubs enter a dormant phase. We assume that the rate a dormant 
edge belonging to a susceptible individual finds a new susceptible neighbor is 771.5s, and the rate it finds a 
new infected neighbor is 771,5/- These may depend on the density of susceptible and infected individuals in 
the population. Similarly, active edges break at rates depending on the status of the nodes. Edges between 
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susceptible nodes break at rate ri2,sSi edges between a susceptible and infected node breaks at rate ri2^si, 
and edges between infected nodes break at rate 772,//- 

We define 0s > 'Prj ^tnd (po to be the probability that a stub belonging to the test node u has never 
been part of an edge that transmitted infection to u, and that the stub is currently connected to a susceptible, 
infected, or recovered node or is dormant respectively. The fluxes between these states are shoen in figure [T2| 
Unlike in previous cases, we are unable to explicitly calculate (/)s, so we must track the flux into and out 
of (ps- The fluxes between 1^5 and (po are straightforward. However, the flux from (ps to 4>i requires more 
attention. We repeat our derivation from [5; . Consider a neighbor v of the test node u having the following 
properties: the stub belonging to u never transmitted to u prior to joining with the stub belonging to w, 
and similarly the stub belonging to v never transmitted to v prior to joining with the stub belonging to u. 
Given this, the probability v is susceptible is g = Efc kP{k)e^-^ / (K) = iP' {9) / ip' {I) . Thus, given that v is 
susceptible, the rate v becomes infected is 

q ^'{9)/^'{l) ' "^^ V'W 

Thus the flux from (ps to (pi is P(j)s(f)ii'" [9) / ip' {6) , the product of 0s, the probability that a stub has not 
transmitted infection to the test node and connects to a susceptible node, with ficpjip" {9)/ip' (9), the rate 
that the neighbor becomes infected given that the stub has not transmitted and connects to a susceptible 
node. 

We need to account for the number of stubs that are in edges between different classes of nodes or 
are dormant. We use Mss to be the proportion of all stubs that are in edges between susceptible nodes. 
Equivalently this is the probability that a stub is active, connects to a susceptible node, and belongs to a 
susceptible node: Mss = 4>S'4^' {S) /ip' {!). We similarly define Msi to be the proportion of all stubs that 
are in edges between a susceptible and infected node. We calculate this by finding the probability a stub is 
active, connects to an infected node, but belongs to a susceptible node but we must multiply by 2 because 
this only counts one stub in each edge. We get Msi — 2(p iip' [9) / %p' (1) . We also define Mn to be the 
proportion of all stubs in edges between infected nodes. We will calculate its value later. We set tts to be 
the proportion of stubs that are dormant and belong to susceptible nodes and tt/ to be the proportion of 
stubs that are dormant and belong to infected nodes. The value of tts can be calculated as for the dormant 
contact case to be tts = (p oi^' {d) / ^p' [1) . The value of tt/ is calculated by finding the fluxes out of the other 
states. Once we have all of these variables, we have M// = 1 — Mss ~ Msi — tts — tti- 

Following figure [12] we find 

9 = ~/?0/ (21) 

ip" {9) 
iP'{9) 

ip" (9) 

(pl = Tll.Sl{'^S-,'^l)'pD - m.Sl{T^S,'^l)(pI - /30/ + P4>I'PS— 777^ (23) 

■iP'{9) 

(pD = ?72,ss(7i's, 7r/)0s + m,si{T^s, '^i)(pi - [m,ss{'^s, TT/) + ?7l,S/(7I'S, '^i)]4>D (24) 

^^ = '^^V7(I) ^'^^ 

f?2,S7(7i-s,7r7) , fli / "^"W 

TT/ = Msi -TTsm.siV^s.T^ij + mJiyT^s.T^ijMii -TTirjiji + p(pDWi—jTT^ (26) 

Mss = Ps^^ (27) 

Ms, = 20,|i^ (28) 
Mil = 1 - Mss - Msi - tts - tti (29) 
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Figure 12: Fixed-degree serosorting model. Flow diagram showing the interplay involved in serosorting 
with fixed-degree. We do not consider a recovered class, which simplifies the equations. The framework can 
be adapted to include a recovered class. The M variables represent the total proportion of stubs involved 
in edges between the two types and the tt variables are the proportion of dormant stubs belonging to nodes 
of each type. The (j) variables are as before. For the right hand side, we are able to determine most of the 
variables analytically, so we only need the fluxes into and out of ttj. We expect that the edge breaking and 
rejoining rates rj will depend on values of tts and ttj. 

where ?7i,ss, Vi.si^ Vi.iii V2,ss, V2,si, and r]2ji are likely to depend on tt/ and ns and may depend on the 
other quantities. What form that dependence takes is determined by the behavior of the population. This 
completes the derivation of the equations for the actual degree formulation of serosorting. 

4.2 Variable Degree Serosorting model 

In many populations, it is reasonable to assume that individuals create and break contacts without regard to 
whether contacts already exist. Consequently, the concept of having a fixed number of stubs is inappropriate. 
For these populations, we assume that in the absence of disease all contacts will have the same expected 
duration but different individuals will create new contacts at different rates, resulting in some having higher 
or lower average degrees. In [5], we used k to be the expected degree of a node. However, when behavior 
changes based on infection status, the expected degree of individuals can change. Instead, we refer to n as 
the desired degree because depending on how sorting happens, it may not be possible for a node to have 
expected degree n. However, k will represent the expected degree of an individual if there were no infection 
present. We again use S and / to be proportions of the population. Hg and H/ measure the proportion of 
desired contacts which belong to susceptible or infected individuals: Hg — J SkKp{k) dK, Ilj — J iK,K,p(K) 
where and denote the proportion of individuals with desired degree k who are susceptible and infected 
respectively. 

We assume that the population behavior proceeds as before, but an uninfected node will end and form 
contacts with different rates for infected or susceptible neighbors. There are many ways in which this could 
be modeled. We will assume that a susceptible individual with desired degree k acquires new susceptible 
contacts at rate Kryi^sHg and new infected contacts at rate kt^i^H/, where both rj parameters may depend 
on Hs and H/. Similarly, a susceptible individual will end an existing contact with another susceptible and 
with an infected individual at rates 772,5 and 772,7 respectively (where again both rj parameters may depend 
on H5 and H/). We assume that 7/1,5 and 772,5 are equal if H/ = so that in a disease-free population an 
individual's expected and desired degrees coincide. 
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Figure 13: Variable-degree serosorting model. Flow diagram showing the interplay involved in serosort- 
ing. We do not consider a recovered class, which simplifies the equations significantly. The framework can 
be adapted to include a recovered class. The 11 variables give the proportion of contacts that would be 
formed with susceptible or infected individuals assuming that their behavior is not altered by disease. The $ 
variables are the probability that a current contact of the test node is with an individual of given type, under 
the assumption that the test node always behaves as if susceptible, and does not transmit to its neighbors. 
We expect that the edge breaking and rejoining rates t] will depend on values of Us and 11/. 



We need to add variables in order to track the probability of having existing edges connecting to sus- 
ceptible or infected nodes. Consider a test node ui with desired degree k, and another U2 with desired 
degree k + Ak. We define <&sAk; to be the expected additional number of edges to susceptible neighbors 
that U2 would have and ^jAk to be the expected additional number of edges to infected neighbors which 
have not transmitted that U2 would have. We take the values of <I>s and $/ in the Ak — )■ limit. In the 
cases considered in [5], the value of $s and IXg were the same. However, because there is active selection of 
neighbor based on disease status, in this case $s 7^ Hg. 

The resulting flow diagram is shown in figure[T3] We must find the flux from $5 to Consider a random 
test node u and look at a randomly chosen susceptible neighbor v. Given the desired degree of w, the rate 
that V becomes infected is /S^jKy. We need to determine the expected value of given that u is a susceptible 
neighbor of u. We first note that the probability density function for the neighbor to be susceptible and 
have degree k is proportional to q^n) — e~'^'^^~^^ kp{k) / (K) with some proportionality constant a. So in 
order to calculate the expected value of the desired degree we take / aKq{K)dK/ J aq{K)dK. This simplifies 
to 4'"(e)/4''(e). So the flux from $5 to $7 due to infection of the neighbor is ,3$5$/*"(e)/^''(e) 

We flnd 

e = (30) 

*5 = 77i,s(ns, n7)n5 - msi^s, Ui)<^s - /^*s*/^7^ (3i) 
= mji^s,Ui)Ui - [v2j{ns, Uj) + + /3$s$7^^ (32) 

n5 = ^ Il, = l-Ils (33) 

S ^ *(e) 7=1-5 (34) 

So an SI epidemic in a population with serosorting can be captured by a system with just three ODEs. 
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5 Discussion 



We have applied the edge-based compartmental model approach introduced in ^5 to diseases and populations 
with different structures. With the exception of serosorting we focused our attention on static CM networks. 
We have considered each variation in isolation. However it is possible to adapt the approach to a disease for 
which several of these issues are considered simultaneously in any of the network classes discussed in [5]. 

In general, we can adapt most existing mean-field/mass-action style SIR models in a closed population 
to the spread of infectious disease through a network. When we do this, we get a variable corresponding 
to each of the S, I, or R variables in the usual model. We take the usual flow diagram for S, I, and R and 
adapt it to give the fluxes between the cj) variables. We add one more compartment 1 — 9, and flux goes from 
each of the potentially infectious cj) variables to 1 — 9. This approach produces an accurate model for disease 
spread through the modeled population. 
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